Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 557 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 115.6 KiB |
| Average record size in memory | 212.6 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 2 |
df_index has unique values | Unique |
Reproduction
| Analysis started | 2021-04-05 08:50:20.201323 |
|---|---|
| Analysis finished | 2021-04-05 08:50:30.518632 |
| Duration | 10.32 seconds |
| Software version | pandas-profiling v2.11.0 |
| Download configuration | config.yaml |
| Distinct | 557 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 299.6229803 |
|---|---|
| Minimum | 0 |
| Maximum | 582 |
| Zeros | 1 |
| Zeros (%) | 0.2% |
| Memory size | 4.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 31.8 |
| Q1 | 159 |
| median | 304 |
| Q3 | 443 |
| 95-th percentile | 554.2 |
| Maximum | 582 |
| Range | 582 |
| Interquartile range (IQR) | 284 |
Descriptive statistics
| Standard deviation | 166.9263764 |
|---|---|
| Coefficient of variation (CV) | 0.5571214074 |
| Kurtosis | -1.163090008 |
| Mean | 299.6229803 |
| Median Absolute Deviation (MAD) | 142 |
| Skewness | -0.07214802646 |
| Sum | 166890 |
| Variance | 27864.41515 |
| Monotocity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | 0.2% |
| 400 | 1 | 0.2% |
| 394 | 1 | 0.2% |
| 395 | 1 | 0.2% |
| 396 | 1 | 0.2% |
| 397 | 1 | 0.2% |
| 398 | 1 | 0.2% |
| 399 | 1 | 0.2% |
| 401 | 1 | 0.2% |
| 409 | 1 | 0.2% |
| Other values (547) | 547 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 582 | 1 | |
| 581 | 1 | |
| 580 | 1 | |
| 579 | 1 | |
| 578 | 1 | |
| 577 | 1 | |
| 576 | 1 | |
| 575 | 1 | |
| 574 | 1 | |
| 573 | 1 |
Age
Real number (ℝ≥0)
| Distinct | 72 |
|---|---|
| Distinct (%) | 12.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 44.95691203 |
|---|---|
| Minimum | 4 |
| Maximum | 90 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.5 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 18 |
| Q1 | 33 |
| median | 45 |
| Q3 | 58 |
| 95-th percentile | 72 |
| Maximum | 90 |
| Range | 86 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 16.2961004 |
|---|---|
| Coefficient of variation (CV) | 0.3624826454 |
| Kurtosis | -0.5710065581 |
| Mean | 44.95691203 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.06452886765 |
| Sum | 25041 |
| Variance | 265.5628883 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 60 | 34 | 6.1% |
| 45 | 25 | 4.5% |
| 50 | 23 | 4.1% |
| 32 | 20 | 3.6% |
| 48 | 20 | 3.6% |
| 38 | 19 | 3.4% |
| 42 | 19 | 3.4% |
| 55 | 18 | 3.2% |
| 65 | 17 | 3.1% |
| 46 | 16 | 2.9% |
| Other values (62) | 346 |
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 6 | 1 | 0.2% |
| 7 | 2 | |
| 8 | 1 | 0.2% |
| 10 | 1 | 0.2% |
| 11 | 1 | 0.2% |
| 12 | 2 | |
| 13 | 4 | |
| 14 | 2 | |
| 15 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 90 | 1 | 0.2% |
| 85 | 1 | 0.2% |
| 84 | 1 | 0.2% |
| 78 | 1 | 0.2% |
| 75 | 14 | |
| 74 | 4 | 0.7% |
| 73 | 2 | 0.4% |
| 72 | 6 | |
| 70 | 9 | |
| 69 | 2 | 0.4% |
Gender
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.6 KiB |
| Male | |
|---|---|
| Female |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.49551167 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2504 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Female |
|---|---|
| 2nd row | Male |
| 3rd row | Male |
| 4th row | Male |
| 5th row | Male |
| Value | Count | Frequency (%) |
| Male | 419 | |
| Female | 138 | 24.8% |
| Value | Count | Frequency (%) |
| male | 419 | |
| female | 138 | 24.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 695 | |
| a | 557 | |
| l | 557 | |
| M | 419 | |
| F | 138 | 5.5% |
| m | 138 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1947 | |
| Uppercase Letter | 557 | 22.2% |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 695 | |
| a | 557 | |
| l | 557 | |
| m | 138 | 7.1% |
| Value | Count | Frequency (%) |
| M | 419 | |
| F | 138 | 24.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2504 |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 695 | |
| a | 557 | |
| l | 557 | |
| M | 419 | |
| F | 138 | 5.5% |
| m | 138 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2504 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 695 | |
| a | 557 | |
| l | 557 | |
| M | 419 | |
| F | 138 | 5.5% |
| m | 138 | 5.5% |
Total_Bilirubin
Real number (ℝ≥0)
| Distinct | 111 |
|---|---|
| Distinct (%) | 19.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.345780969 |
|---|---|
| Minimum | 0.4 |
| Maximum | 75 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.5 KiB |
Quantile statistics
| Minimum | 0.4 |
|---|---|
| 5-th percentile | 0.6 |
| Q1 | 0.8 |
| median | 1 |
| Q3 | 2.6 |
| 95-th percentile | 16.62 |
| Maximum | 75 |
| Range | 74.6 |
| Interquartile range (IQR) | 1.8 |
Descriptive statistics
| Standard deviation | 6.328424882 |
|---|---|
| Coefficient of variation (CV) | 1.891464187 |
| Kurtosis | 35.84007349 |
| Mean | 3.345780969 |
| Median Absolute Deviation (MAD) | 0.3 |
| Skewness | 4.830356055 |
| Sum | 1863.6 |
| Variance | 40.04896148 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.8 | 87 | |
| 0.7 | 75 | 13.5% |
| 0.9 | 55 | 9.9% |
| 0.6 | 42 | 7.5% |
| 1 | 26 | 4.7% |
| 1.1 | 19 | 3.4% |
| 1.8 | 14 | 2.5% |
| 1.4 | 13 | 2.3% |
| 1.3 | 12 | 2.2% |
| 1.7 | 11 | 2.0% |
| Other values (101) | 203 |
| Value | Count | Frequency (%) |
| 0.4 | 1 | 0.2% |
| 0.5 | 5 | 0.9% |
| 0.6 | 42 | |
| 0.7 | 75 | |
| 0.8 | 87 | |
| 0.9 | 55 | |
| 1 | 26 | 4.7% |
| 1.1 | 19 | 3.4% |
| 1.2 | 8 | 1.4% |
| 1.3 | 12 | 2.2% |
| Value | Count | Frequency (%) |
| 75 | 1 | |
| 42.8 | 1 | |
| 32.6 | 1 | |
| 30.8 | 1 | |
| 30.5 | 2 | |
| 27.7 | 1 | |
| 27.2 | 1 | |
| 26.3 | 1 | |
| 25 | 1 | |
| 23.3 | 1 |
Direct_Bilirubin
Real number (ℝ≥0)
| Distinct | 79 |
|---|---|
| Distinct (%) | 14.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.50951526 |
|---|---|
| Minimum | 0.1 |
| Maximum | 19.7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.5 KiB |
Quantile statistics
| Minimum | 0.1 |
|---|---|
| 5-th percentile | 0.1 |
| Q1 | 0.2 |
| median | 0.3 |
| Q3 | 1.3 |
| 95-th percentile | 8.5 |
| Maximum | 19.7 |
| Range | 19.6 |
| Interquartile range (IQR) | 1.1 |
Descriptive statistics
| Standard deviation | 2.858843269 |
|---|---|
| Coefficient of variation (CV) | 1.893881661 |
| Kurtosis | 10.88827195 |
| Mean | 1.50951526 |
| Median Absolute Deviation (MAD) | 0.2 |
| Skewness | 3.162135724 |
| Sum | 840.8 |
| Variance | 8.172984837 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.2 | 190 | |
| 0.1 | 57 | 10.2% |
| 0.3 | 49 | 8.8% |
| 0.8 | 22 | 3.9% |
| 0.4 | 19 | 3.4% |
| 0.5 | 18 | 3.2% |
| 0.6 | 16 | 2.9% |
| 1 | 13 | 2.3% |
| 1.3 | 12 | 2.2% |
| 1.6 | 11 | 2.0% |
| Other values (69) | 150 |
| Value | Count | Frequency (%) |
| 0.1 | 57 | 10.2% |
| 0.2 | 190 | |
| 0.3 | 49 | 8.8% |
| 0.4 | 19 | 3.4% |
| 0.5 | 18 | 3.2% |
| 0.6 | 16 | 2.9% |
| 0.7 | 11 | 2.0% |
| 0.8 | 22 | 3.9% |
| 0.9 | 5 | 0.9% |
| 1 | 13 | 2.3% |
| Value | Count | Frequency (%) |
| 19.7 | 1 | |
| 18.3 | 1 | |
| 17.1 | 1 | |
| 14.2 | 1 | |
| 14.1 | 1 | |
| 13.7 | 1 | |
| 12.8 | 1 | |
| 12.6 | 2 | |
| 12.1 | 1 | |
| 11.8 | 2 |
Alkaline_Phosphotase
Real number (ℝ≥0)
| Distinct | 261 |
|---|---|
| Distinct (%) | 46.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 292.9802513 |
|---|---|
| Minimum | 63 |
| Maximum | 2110 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.5 KiB |
Quantile statistics
| Minimum | 63 |
|---|---|
| 5-th percentile | 135 |
| Q1 | 176 |
| median | 208 |
| Q3 | 298 |
| 95-th percentile | 725.2 |
| Maximum | 2110 |
| Range | 2047 |
| Interquartile range (IQR) | 122 |
Descriptive statistics
| Standard deviation | 247.7258681 |
|---|---|
| Coefficient of variation (CV) | 0.845537769 |
| Kurtosis | 16.95388999 |
| Mean | 292.9802513 |
| Median Absolute Deviation (MAD) | 49 |
| Skewness | 3.69082235 |
| Sum | 163190 |
| Variance | 61368.10572 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 215 | 11 | 2.0% |
| 198 | 11 | 2.0% |
| 298 | 11 | 2.0% |
| 195 | 10 | 1.8% |
| 190 | 10 | 1.8% |
| 182 | 9 | 1.6% |
| 165 | 8 | 1.4% |
| 180 | 8 | 1.4% |
| 188 | 7 | 1.3% |
| 202 | 7 | 1.3% |
| Other values (251) | 465 |
| Value | Count | Frequency (%) |
| 63 | 1 | |
| 75 | 1 | |
| 90 | 1 | |
| 92 | 2 | |
| 97 | 1 | |
| 98 | 1 | |
| 100 | 2 | |
| 102 | 1 | |
| 103 | 1 | |
| 105 | 1 |
| Value | Count | Frequency (%) |
| 2110 | 1 | |
| 1896 | 1 | |
| 1750 | 1 | |
| 1630 | 1 | |
| 1620 | 1 | |
| 1580 | 1 | |
| 1550 | 1 | |
| 1420 | 1 | |
| 1350 | 2 | |
| 1124 | 1 |
Alamine_Aminotransferase
Real number (ℝ≥0)
| Distinct | 148 |
|---|---|
| Distinct (%) | 26.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 78.69658887 |
|---|---|
| Minimum | 10 |
| Maximum | 2000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.5 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 14.8 |
| Q1 | 23 |
| median | 35 |
| Q3 | 60 |
| 95-th percentile | 222 |
| Maximum | 2000 |
| Range | 1990 |
| Interquartile range (IQR) | 37 |
Descriptive statistics
| Standard deviation | 180.2557023 |
|---|---|
| Coefficient of variation (CV) | 2.29051481 |
| Kurtosis | 55.05109083 |
| Mean | 78.69658887 |
| Median Absolute Deviation (MAD) | 15 |
| Skewness | 6.853131303 |
| Sum | 43834 |
| Variance | 32492.11821 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 25 | 25 | 4.5% |
| 20 | 21 | 3.8% |
| 22 | 18 | 3.2% |
| 28 | 17 | 3.1% |
| 18 | 17 | 3.1% |
| 21 | 17 | 3.1% |
| 30 | 15 | 2.7% |
| 15 | 14 | 2.5% |
| 24 | 13 | 2.3% |
| 48 | 12 | 2.2% |
| Other values (138) | 388 |
| Value | Count | Frequency (%) |
| 10 | 4 | 0.7% |
| 11 | 2 | 0.4% |
| 12 | 10 | |
| 13 | 4 | 0.7% |
| 14 | 8 | |
| 15 | 14 | |
| 16 | 8 | |
| 17 | 8 | |
| 18 | 17 | |
| 19 | 6 | 1.1% |
| Value | Count | Frequency (%) |
| 2000 | 1 | |
| 1680 | 1 | |
| 1630 | 1 | |
| 1350 | 1 | |
| 1250 | 2 | |
| 950 | 1 | |
| 790 | 1 | |
| 779 | 1 | |
| 622 | 1 | |
| 509 | 1 |
Aspartate_Aminotransferase
Real number (ℝ≥0)
| Distinct | 173 |
|---|---|
| Distinct (%) | 31.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 108.8258528 |
|---|---|
| Minimum | 10 |
| Maximum | 4929 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.5 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 15 |
| Q1 | 25 |
| median | 41 |
| Q3 | 86 |
| 95-th percentile | 400.2 |
| Maximum | 4929 |
| Range | 4919 |
| Interquartile range (IQR) | 61 |
Descriptive statistics
| Standard deviation | 292.9194591 |
|---|---|
| Coefficient of variation (CV) | 2.691634861 |
| Kurtosis | 149.6502062 |
| Mean | 108.8258528 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | 10.57151891 |
| Sum | 60616 |
| Variance | 85801.80955 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 23 | 16 | 2.9% |
| 20 | 14 | 2.5% |
| 30 | 14 | 2.5% |
| 21 | 14 | 2.5% |
| 25 | 13 | 2.3% |
| 28 | 13 | 2.3% |
| 22 | 13 | 2.3% |
| 32 | 12 | 2.2% |
| 24 | 12 | 2.2% |
| 29 | 11 | 2.0% |
| Other values (163) | 425 |
| Value | Count | Frequency (%) |
| 10 | 1 | 0.2% |
| 11 | 2 | 0.4% |
| 12 | 5 | |
| 13 | 3 | 0.5% |
| 14 | 8 | |
| 15 | 11 | |
| 16 | 9 | |
| 17 | 8 | |
| 18 | 9 | |
| 19 | 11 |
| Value | Count | Frequency (%) |
| 4929 | 1 | 0.2% |
| 2946 | 1 | 0.2% |
| 1600 | 1 | 0.2% |
| 1500 | 1 | 0.2% |
| 1050 | 2 | |
| 960 | 1 | 0.2% |
| 950 | 1 | 0.2% |
| 850 | 4 | |
| 844 | 1 | 0.2% |
| 794 | 1 | 0.2% |
Total_Protiens
Real number (ℝ≥0)
| Distinct | 58 |
|---|---|
| Distinct (%) | 10.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.51005386 |
|---|---|
| Minimum | 2.7 |
| Maximum | 9.6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.5 KiB |
Quantile statistics
| Minimum | 2.7 |
|---|---|
| 5-th percentile | 4.6 |
| Q1 | 5.8 |
| median | 6.6 |
| Q3 | 7.2 |
| 95-th percentile | 8.12 |
| Maximum | 9.6 |
| Range | 6.9 |
| Interquartile range (IQR) | 1.4 |
Descriptive statistics
| Standard deviation | 1.091105059 |
|---|---|
| Coefficient of variation (CV) | 0.167603077 |
| Kurtosis | 0.3020396956 |
| Mean | 6.51005386 |
| Median Absolute Deviation (MAD) | 0.7 |
| Skewness | -0.3374894565 |
| Sum | 3626.1 |
| Variance | 1.190510249 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 32 | 5.7% |
| 6.8 | 26 | 4.7% |
| 6 | 26 | 4.7% |
| 6.9 | 25 | 4.5% |
| 6.2 | 24 | 4.3% |
| 7.1 | 22 | 3.9% |
| 8 | 20 | 3.6% |
| 7.2 | 19 | 3.4% |
| 7.3 | 18 | 3.2% |
| 6.1 | 18 | 3.2% |
| Other values (48) | 327 |
| Value | Count | Frequency (%) |
| 2.7 | 1 | 0.2% |
| 2.8 | 1 | 0.2% |
| 3 | 1 | 0.2% |
| 3.6 | 3 | |
| 3.7 | 1 | 0.2% |
| 3.8 | 2 | |
| 3.9 | 2 | |
| 4 | 2 | |
| 4.1 | 2 | |
| 4.3 | 3 |
| Value | Count | Frequency (%) |
| 9.6 | 1 | 0.2% |
| 9.5 | 1 | 0.2% |
| 9.2 | 2 | 0.4% |
| 8.9 | 1 | 0.2% |
| 8.7 | 1 | 0.2% |
| 8.6 | 3 | 0.5% |
| 8.5 | 5 | |
| 8.4 | 3 | 0.5% |
| 8.3 | 3 | 0.5% |
| 8.2 | 8 |
Albumin
Real number (ℝ≥0)
| Distinct | 40 |
|---|---|
| Distinct (%) | 7.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.156373429 |
|---|---|
| Minimum | 0.9 |
| Maximum | 5.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.5 KiB |
Quantile statistics
| Minimum | 0.9 |
|---|---|
| 5-th percentile | 1.8 |
| Q1 | 2.6 |
| median | 3.1 |
| Q3 | 3.8 |
| 95-th percentile | 4.4 |
| Maximum | 5.5 |
| Range | 4.6 |
| Interquartile range (IQR) | 1.2 |
Descriptive statistics
| Standard deviation | 0.7980978672 |
|---|---|
| Coefficient of variation (CV) | 0.2528528025 |
| Kurtosis | -0.3566704456 |
| Mean | 3.156373429 |
| Median Absolute Deviation (MAD) | 0.6 |
| Skewness | -0.07879703319 |
| Sum | 1758.1 |
| Variance | 0.6369602056 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 43 | 7.7% |
| 4 | 37 | 6.6% |
| 2.9 | 29 | 5.2% |
| 3.1 | 26 | 4.7% |
| 3.2 | 26 | 4.7% |
| 3.9 | 25 | 4.5% |
| 3.5 | 23 | 4.1% |
| 2.5 | 22 | 3.9% |
| 3.4 | 21 | 3.8% |
| 3.3 | 21 | 3.8% |
| Other values (30) | 284 |
| Value | Count | Frequency (%) |
| 0.9 | 2 | 0.4% |
| 1 | 1 | 0.2% |
| 1.4 | 3 | 0.5% |
| 1.5 | 3 | 0.5% |
| 1.6 | 8 | |
| 1.7 | 3 | 0.5% |
| 1.8 | 12 | |
| 1.9 | 7 | |
| 2 | 17 | |
| 2.1 | 14 |
| Value | Count | Frequency (%) |
| 5.5 | 2 | 0.4% |
| 5 | 1 | 0.2% |
| 4.9 | 4 | 0.7% |
| 4.8 | 2 | 0.4% |
| 4.7 | 3 | 0.5% |
| 4.6 | 4 | 0.7% |
| 4.5 | 6 | |
| 4.4 | 8 | |
| 4.3 | 12 | |
| 4.2 | 12 |
Albumin_and_Globulin_Ratio
Real number (ℝ≥0)
| Distinct | 70 |
|---|---|
| Distinct (%) | 12.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.9489873418 |
|---|---|
| Minimum | 0.3 |
| Maximum | 2.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.5 KiB |
Quantile statistics
| Minimum | 0.3 |
|---|---|
| 5-th percentile | 0.5 |
| Q1 | 0.7 |
| median | 0.95 |
| Q3 | 1.1 |
| 95-th percentile | 1.5 |
| Maximum | 2.8 |
| Range | 2.5 |
| Interquartile range (IQR) | 0.4 |
Descriptive statistics
| Standard deviation | 0.318525811 |
|---|---|
| Coefficient of variation (CV) | 0.3356481135 |
| Kurtosis | 3.476751904 |
| Mean | 0.9489873418 |
| Median Absolute Deviation (MAD) | 0.15 |
| Skewness | 1.014359717 |
| Sum | 528.5859494 |
| Variance | 0.1014586923 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 102 | |
| 0.8 | 59 | |
| 0.9 | 55 | |
| 0.7 | 53 | |
| 1.1 | 44 | 7.9% |
| 1.2 | 35 | 6.3% |
| 0.6 | 31 | 5.6% |
| 1.3 | 25 | 4.5% |
| 0.5 | 23 | 4.1% |
| 1.4 | 17 | 3.1% |
| Other values (60) | 113 |
| Value | Count | Frequency (%) |
| 0.3 | 4 | 0.7% |
| 0.35 | 1 | 0.2% |
| 0.37 | 1 | 0.2% |
| 0.39 | 1 | 0.2% |
| 0.4 | 14 | |
| 0.45 | 1 | 0.2% |
| 0.46 | 1 | 0.2% |
| 0.47 | 2 | 0.4% |
| 0.48 | 1 | 0.2% |
| 0.5 | 23 |
| Value | Count | Frequency (%) |
| 2.8 | 1 | 0.2% |
| 2.5 | 2 | |
| 1.9 | 1 | 0.2% |
| 1.85 | 2 | |
| 1.8 | 3 | |
| 1.72 | 1 | 0.2% |
| 1.7 | 4 | |
| 1.66 | 1 | 0.2% |
| 1.6 | 3 | |
| 1.58 | 2 |
Liver_Disease
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.7 KiB |
| Liver Disease | |
|---|---|
| No Liver Disease |
Length
| Max length | 16 |
|---|---|
| Median length | 13 |
| Mean length | 13.86714542 |
| Min length | 13 |
Characters and Unicode
| Total characters | 7724 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Liver Disease |
|---|---|
| 2nd row | Liver Disease |
| 3rd row | Liver Disease |
| 4th row | Liver Disease |
| 5th row | Liver Disease |
| Value | Count | Frequency (%) |
| Liver Disease | 396 | |
| No Liver Disease | 161 |
| Value | Count | Frequency (%) |
| disease | 557 | |
| liver | 557 | |
| no | 161 | 12.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1671 | |
| i | 1114 | |
| s | 1114 | |
| 718 | ||
| L | 557 | 7.2% |
| v | 557 | 7.2% |
| r | 557 | 7.2% |
| D | 557 | 7.2% |
| a | 557 | 7.2% |
| N | 161 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5731 | |
| Uppercase Letter | 1275 | 16.5% |
| Space Separator | 718 | 9.3% |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 1671 | |
| i | 1114 | |
| s | 1114 | |
| v | 557 | 9.7% |
| r | 557 | 9.7% |
| a | 557 | 9.7% |
| o | 161 | 2.8% |
| Value | Count | Frequency (%) |
| L | 557 | |
| D | 557 | |
| N | 161 | 12.6% |
| Value | Count | Frequency (%) |
| 718 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7006 | |
| Common | 718 | 9.3% |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 1671 | |
| i | 1114 | |
| s | 1114 | |
| L | 557 | 8.0% |
| v | 557 | 8.0% |
| r | 557 | 8.0% |
| D | 557 | 8.0% |
| a | 557 | 8.0% |
| N | 161 | 2.3% |
| o | 161 | 2.3% |
| Value | Count | Frequency (%) |
| 718 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7724 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 1671 | |
| i | 1114 | |
| s | 1114 | |
| 718 | ||
| L | 557 | 7.2% |
| v | 557 | 7.2% |
| r | 557 | 7.2% |
| D | 557 | 7.2% |
| a | 557 | 7.2% |
| N | 161 | 2.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| df_index | Age | Gender | Total_Bilirubin | Direct_Bilirubin | Alkaline_Phosphotase | Alamine_Aminotransferase | Aspartate_Aminotransferase | Total_Protiens | Albumin | Albumin_and_Globulin_Ratio | Liver_Disease | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 65 | Female | 0.7 | 0.1 | 187 | 16 | 18 | 6.8 | 3.3 | 0.90 | Liver Disease |
| 1 | 1 | 62 | Male | 10.9 | 5.5 | 699 | 64 | 100 | 7.5 | 3.2 | 0.74 | Liver Disease |
| 2 | 2 | 62 | Male | 7.3 | 4.1 | 490 | 60 | 68 | 7.0 | 3.3 | 0.89 | Liver Disease |
| 3 | 3 | 58 | Male | 1.0 | 0.4 | 182 | 14 | 20 | 6.8 | 3.4 | 1.00 | Liver Disease |
| 4 | 4 | 72 | Male | 3.9 | 2.0 | 195 | 27 | 59 | 7.3 | 2.4 | 0.40 | Liver Disease |
| 5 | 5 | 46 | Male | 1.8 | 0.7 | 208 | 19 | 14 | 7.6 | 4.4 | 1.30 | Liver Disease |
| 6 | 6 | 26 | Female | 0.9 | 0.2 | 154 | 16 | 12 | 7.0 | 3.5 | 1.00 | Liver Disease |
| 7 | 7 | 29 | Female | 0.9 | 0.3 | 202 | 14 | 11 | 6.7 | 3.6 | 1.10 | Liver Disease |
| 8 | 8 | 17 | Male | 0.9 | 0.3 | 202 | 22 | 19 | 7.4 | 4.1 | 1.20 | No Liver Disease |
| 9 | 9 | 55 | Male | 0.7 | 0.2 | 290 | 53 | 58 | 6.8 | 3.4 | 1.00 | Liver Disease |
Last rows
| df_index | Age | Gender | Total_Bilirubin | Direct_Bilirubin | Alkaline_Phosphotase | Alamine_Aminotransferase | Aspartate_Aminotransferase | Total_Protiens | Albumin | Albumin_and_Globulin_Ratio | Liver_Disease | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 547 | 573 | 32 | Male | 3.7 | 1.6 | 612 | 50 | 88 | 6.2 | 1.9 | 0.40 | Liver Disease |
| 548 | 574 | 32 | Male | 12.1 | 6.0 | 515 | 48 | 92 | 6.6 | 2.4 | 0.50 | Liver Disease |
| 549 | 575 | 32 | Male | 25.0 | 13.7 | 560 | 41 | 88 | 7.9 | 2.5 | 2.50 | Liver Disease |
| 550 | 576 | 32 | Male | 15.0 | 8.2 | 289 | 58 | 80 | 5.3 | 2.2 | 0.70 | Liver Disease |
| 551 | 577 | 32 | Male | 12.7 | 8.4 | 190 | 28 | 47 | 5.4 | 2.6 | 0.90 | Liver Disease |
| 552 | 578 | 60 | Male | 0.5 | 0.1 | 500 | 20 | 34 | 5.9 | 1.6 | 0.37 | No Liver Disease |
| 553 | 579 | 40 | Male | 0.6 | 0.1 | 98 | 35 | 31 | 6.0 | 3.2 | 1.10 | Liver Disease |
| 554 | 580 | 52 | Male | 0.8 | 0.2 | 245 | 48 | 49 | 6.4 | 3.2 | 1.00 | Liver Disease |
| 555 | 581 | 31 | Male | 1.3 | 0.5 | 184 | 29 | 32 | 6.8 | 3.4 | 1.00 | Liver Disease |
| 556 | 582 | 38 | Male | 1.0 | 0.3 | 216 | 21 | 24 | 7.3 | 4.4 | 1.50 | No Liver Disease |